A Two Phase Method for Information Extraction

نویسنده

  • Vesna Pajić
چکیده

In biology and functional genomics in particular, understanding the dependence and interplay between different genome and ecological characteristics of organisms is a very challenging problem. There are some public databases which combine this kind of information, but there is still much more information about microbes and other organisms that reside in unstructured and semi-structured documents, such as encyclopaedias. In this paper we present a method for extracting information from semi-structured resources, such as encyclopaedias, based on finite state transducers, consisting of two clearly distinguished phases. The first phase strongly relies on the analysis of the document structure and it is used for locating records of data in the text. The second phase is based on the finite state transducers created for extracting the data, which can be modified so as to achieve the preferred efficiency and it is used for extracting the particular characteristic from the text. We show how the two phase method is applied to the text of the encyclopaedia “Systematic Bacteriology”. A fully structured database with genotype and phenotype characteristics of organisms has been created from the encyclopaedia unstructured descriptions.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Disguised Face Recognition by Using Local Phase Quantization and Singular Value Decomposition

Disguised face recognition is a major challenge in the field of face recognition which has been taken less attention. Therefore, in this paper a disguised face recognition algorithm based on Local Phase Quantization (LPQ) method and Singular Value Decomposition (SVD) is presented which deals with two main challenges. The first challenge is when an individual intentionally alters the appearance ...

متن کامل

Physical separation of amphiprotic-polar aprotic solvents for simultaneous extraction and clean-up of clomiphene from plasma before liquid chromatographic analyzes

An efficient and quantitative two phase freezing (TPF) method coupled with high performance liquid chromatography and UV-Vis detector was developed for the extraction, clean up and determination of clomiphene citrate (CLC) in plasma samples. The separation of two miscible solvents by TPF method permits that the CLC was efficiently removed from proteins and transferred into the relative aprotic ...

متن کامل

Physical separation of amphiprotic-polar aprotic solvents for simultaneous extraction and clean-up of clomiphene from plasma before liquid chromatographic analyzes

An efficient and quantitative two phase freezing (TPF) method coupled with high performance liquid chromatography and UV-Vis detector was developed for the extraction, clean up and determination of clomiphene citrate (CLC) in plasma samples. The separation of two miscible solvents by TPF method permits that the CLC was efficiently removed from proteins and transferred into the relative aprotic ...

متن کامل

Extraction of Cephalexin Using Aqueous Two-Phase Systems Composed of Cholinium Chloride and K3PO4

Cephalexin is an important antibiotic. It is very significant to determine an appropriate method for the extraction of this valuable antibiotic for industrial applications. Aiming at developing an efficient method for the extraction of cephalexin, the partitioning of cephalexin has been evaluated in aqueous two-phase system, including cholinium chloride and potassium phosphate. The effect of th...

متن کامل

Determination of Two Antiepileptic Drugs in Urine by Homogenous Liquid-Liquid Extraction Performed in A Narrow Tube Combined With Dispersive Liquid-liquid Microextraction Followed by Gas Chromatography-flame Ionization Detection

A simple and efficient homogenous liquid-liquid extraction method performed in a narrow tube combined with dispersive liquid-liquid microextraction method has been presented for the simultaneous determination of two antiepileptic drugs in urine followed by gas chromatography with flame ionization detection. In this method, a mixture of acetonitrile and urine sample (homogenous solution) is load...

متن کامل

Determination of Two Antiepileptic Drugs in Urine by Homogenous Liquid-Liquid Extraction Performed in A Narrow Tube Combined With Dispersive Liquid-liquid Microextraction Followed by Gas Chromatography-flame Ionization Detection

A simple and efficient homogenous liquid-liquid extraction method performed in a narrow tube combined with dispersive liquid-liquid microextraction method has been presented for the simultaneous determination of two antiepileptic drugs in urine followed by gas chromatography with flame ionization detection. In this method, a mixture of acetonitrile and urine sample (homogenous solution) is load...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2011